In statistics, probability theory, and information theory, a statistical distance quantifies the distance between two statistical objects, which can be two samples, two random variables, or two probability distributions, for example.
Contents |
A metric on a set X is a function (called the distance function or simply distance)
d : X × X → R
(where R is the set of real numbers). For all x, y, z in X, this function is required to satisfy the following conditions:
Many statistical distances are not metrics, because they lack one or more properties of proper metrics. For example, pseudometrics can violate the "positive definiteness" (alternatively, "identity of indescernibles" property); quasimetrics can violate the symmetry property; and semimetrics can violate the triangle inequality. Some statistical distances are referred to as divergences.
Some important statistical distances include the following:
Other approaches